Temporal ICA for classification of acoustic events i a kitchen environment

نویسندگان

  • Florian Kraft
  • Robert G. Malkin
  • Thomas Schaaf
  • Alexander H. Waibel
چکیده

We describe a feature extraction method for general audio modeling using a temporal extension of Independent Component Analysis (ICA) and demonstrate its utility in the context of a sound classification task in a kitchen environment. Our approach accounts for temporal dependencies over multiple analysis frames much like the standard audio modeling technique of adding first and second temporal derivatives to the feature set. Using a real-world dataset of kitchen sounds, we show that our approach outperforms a canonical version of this standard front end, the mel-frequency cepstral coefficients (MFCCs), which has found successful application in automatic speech recognition tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Temporal ICA for Classification of Acoustic Events in a Kitchen Environment

We describe a feature extraction method for general audio modeling using a temporal extension of Independent Component Analysis (ICA) and demonstrate its utility in the context of a sound classification task in a kitchen environment. Our approach accounts for temporal dependencies over multiple analysis frames much like the standard audio modeling technique of adding first and second temporal d...

متن کامل

Continuous Audio Object Recognition Diploma Thesis

The detection of sound events is a key technology for a various set of audio applications. Sounds are able to transport information through vision borders. Therefore, a humanoid robot assigned with kitchen tasks improves its interactive behavior with the environment a lot when using acoustics. While audio scene analysis employs a lot of subjects, this thesis deals with the recognition of preseg...

متن کامل

Improving the Performance of ICA Algorithm for fMRI Simulated Data Analysis Using Temporal and Spatial Filters in the Preprocessing Phase

Introduction: The accuracy of analyzing Functional MRI (fMRI) data is usually decreases in the presence of noise and artifact sources. A common solution in for analyzing fMRI data having high noise is to use suitable preprocessing methods with the aim of data denoising. Some effects of preprocessing methods on the parametric methods such as general linear model (GLM) have previously been evalua...

متن کامل

Mixed acoustic events classification using ICA and subspace classifier

This paper describes a new neural architecture for unsupervised learning of a classi cation of mixed transient signals. This method is based on neural techniques for blind separation of sources and subspace methods. The feed-forward neural network dynamically builds and refreshes an acoustic events classication by detecting novelties, creating and deleting classes. A self-organization process a...

متن کامل

Daily Activity Recognition based on Meta-classification of Low-level Audio Events

This paper presents a method for recognizing activities taking place in a home environment. Audio is recorded and analysed realtime, with all computation taking place on a low-cost Raspberry PI. In this way, data acquisition, low-level signal feature calculation, and low-level event extraction is performed without transferring any raw data out of the device. This first-level analysis produces a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005